SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(1) APPLICANT: Sullivan, Francis 
Knz, Ron 
Kumar, Ravindra 

{11} TITLE OF INVENTION: Human GDP-Mannose Hydratase 

(in) NUMBER OF SEQUENCES . 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSES: Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATS: Massachusetts 

(E) COUNTRY : USA 

(F) ZIP: 02140 

<v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Fleecy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: 3rown, Sect; A. 

(B) REGISTRATION NUMBER: 32 , 724 

(C) REFERENCE/DOCKET NUMBER : GI5285 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8224 

(B) TELEFAX: (617) 87S-5851 



(2) INFORMATION FOR SEQ ID NO;l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCCGCGTTCC TGCCGGCACC GCGCCTGCCC TCTGCCGCGC TCCGCCCTGC CGCCGACCGC 60 

ACGCCCGCCG CGGGACATGG CACACGCACC GGCACGCTGC CCCAGCGCCC GGGGCTCCGG 120 

GGACGGCGAG ATGGGCAAGC CCAGGAACGT GGCGCTCATC ACCGGTATCA CAGGCCAGGA 180 
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TGGTTCCTAC CTGGCTGAGT TCCTGCTGGA GAAAGGCTAT GAGGTCCATG GAATTGTACG 240 

GCGGTCCAGT TCATTTAATA CGGGTCGAAT TGAGCATCTG TATAAGAATC CCCAGGCTCA 3 00 

CATTGAAGGA AACATGAAGT TG CACTATGG CGATCTCACT GACAGTACCT GCCTTGTGAA 36 0 

GATCATTAAT GAAGTAAAGC CCACAGAGAT CTACAACCTT GGAGCCCAGA GCCACGTCAA 420 

AATTTCCTTT GACCTCGCTG AGTACACTGC GGACGTTGAC GGAGTTGGCA CTCTACGACT 480 

TCTAGATGCA GTTAAGACTT GTGGCCTTAT CAACTCTGTG AAGTTCTACC AAGCCTCAAC 54 0 

AAGTGAACTT TATGGGAAAG TGCAGGAAAT ACCCCAGAAG GAGACCACCC CTTTCTATCC 60 0 

CCGGTCACCC TATGGGGCAG CAAAACTCTA TGCCTATTGG ATTGTGGTGA ACTTCCGTGA 660 

GGCGTATAAT CTCTTTGCAG TGAACGGCAT TCTCTTCAAT CATGAGAGTC CCAGAAGAGG 72 0 

AG CTAATTT C GTTACTCGAA AAATTAGCCG GTCAGTAGCT AAGATTTACC TTGGACAACT 78 0 

GGAATGTTTC AGTTTGGGAA ATCTGGATGC CAAACGAGAT TGGGGCCATG CCAAGGACTA 84 0 

TGTGGAGGCT ATGTGGTTGA TGTTGCAGAA TGATGAGCCG GAGGACTTCG TTATAGCTAC 90 0 

TGGGGAGGTC CATAGTGTCC GGGAATTTGT CGAGAAATCA TTCTTGCACA TTGGAAAAAC 960 

CATTGTGTGG GAAGGAAAGA ATGAAAATGA AGTGGGCAGA TGTAAAGAGA CCGGCAAAGT 102 0 

TCACGTGACT GTGGATCTCA AGTACTACCG GCCAACTGAA GTGGACTTTC TGCAGGGCGA 10 8 0 

CTGCACCAAA GCGAAACAGA AGCTGAACTG GAAGCCCCGG GTCGCTTTCG ATGAGCTGGT 114 0 

GAGGGAGATG GTGCACGCCG ACGTGGAGCT CATGAGGACA AACCCCAATG CCTGAGCAGC 120 0 

GCCTCGGAGC CCGGCCCGCC CTCCGGCTAC AATCCCCGCA GAGTCTCCGG TGCAGACGCG 1260 

CTGCGGGGAT GGGGAGCGGC GTGCCAATCT GCGGGTCCCC TGCGGCCCCT GCTGCCGCTG 1320 

CGCTGTCCCG GCCGCAAGAG CGGGGCCGCC CCGCCGAGGT TTGTAGCAGC CGGGATGTGA 13 8 0 

CCCTCCAGGG TTTGGGTCGC TTTGCGTTTG TCGAAGCCTC CTCTGAATGG CTTTGTGAAA 144 0 

TCAAGATGTT TTAATCACAT TCACTTTACT TGAAATTATG TTGTTACACA ACAAATTGTG 1500 

GGGCCTTCAA ATTGTTTTTC C 1521 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Lys Pro Arg Asn Val Ala Leu lie Thr Gly lie Thr Gly Gin 
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Asp Gly Ser Tyr Leu Ala Glu ?he Leu Leu Glu Lys Gly Tyr Glu Val 
20 25 30 

His Gly He Val Arg Arg Ser Ser Ser Phe Asn Thr Gly Arg He Glu 
35 40 45 

His Leu Tyr Lys Asn Pro Gin Ala His He Glu Gly Asn Met Lys Leu 

50 55 60 

His Tyr Gly Asp Leu Thr As? Ser Thr Cys Leu Val Lys He He Asn 
65 70 75 80 

Glu Val Lys Pro Thr Glu He Tyr Asn Leu Gly Ala Gin Ser His Val 
85 90 95 

Lys He Ser Phe Asp Leu Ala Glu Tyr Thr Ala Asp Val Asp Gly Val 
100 105 110 

Gly Thr Leu Arg Leu Leu Asp Ala Val Lys Thr Cys Gly Leu He Asn 
115 120 125 

Ser Val Lys Phe Tyr Gin Ala Ser Thr Ser Glu Leu Tyr Gly Lys Val 
130 135 140 

Gin Glu He Pro Gin Lys Glu Thr Thr Pro Phe Tyr Pro Arg Ser Pro 
145 150 155 160 

Tyr Gly Ala Ala Lys Leu Tyr Ala Tyr Trp He Val Val Asn Phe Arg 
165 170 175 

Glu Ala Tyr Asn Leu Phe Ala. Val Asn Gly He Leu Phe Asn His Glu 
180 185 190 

Ser Pro Arg Arg Gly Ala Asn Phe Val Thr Arg Lys He Ser Arg Ser 
195 200 205 

Val Ala Lys He Tyr Leu Gly Gin Leu Glu Cys Phe Ser Leu Gly Asn 
210 215 220 

Leu Asp Ala Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val Glu Ala 
225 230 235 240 

Met Trp Leu Met Leu Gin Asn Asp Glu Pro Glu Asp Phe Val He Ala 
245 250 255 

Thr Gly Glu Val His Ser Val Arg Glu Phe Val Glu Lys Ser Phe Leu 
260 265 270 

His He Gly Lys Thr He Val Trp Glu Gly Lys Asn Glu Asn Glu Val 
275 230 285 

Gly Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp Leu Lys 
290 295 300 

Tyr Tyr Arg Pro Thr Glu Val Asp Phe Leu Gin Gly Asp Cys Thr Lys 
305 310 315 320 

Ala Lys Gin Lys Leu Asn Trp Lys Pro Arg Val Ala Phe Asp Glu Leu 
325 330 335 
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Val Arg Glu Met Val His Ala Asp Val Glu Leu Met Arg Thr Asn Pro 
340 345 350 



Asn Ala 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY . linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Ala His Ala Pro Ala Arg Cys Pro Ser Ala Arg Gly Ser Gly Asp 
15 10 15 

Gly Glu Met Gly Lys Pro Arg Asn Val Ala Leu He Thr Gly He Thr 
20 25 30 

Gly Gin Asp Gly Ser Tyr Leu Ala Glu Phe Leu Leu Glu Lys Gly Tyr 
35 40 45 

Glu Val His Gly He Val Arg Arg Ser Ser Ser Phe Asn Thr Gly Arg 
50 55 60 

He Glu His Leu Tyr Lys Asn Pro Gin Ala His He Glu Gly Asn Met 
65 70 75 80 

Lys Leu His Tyr Gly Asp Leu Thr Asp Ser Thr Cys Leu Val Lys He 
85 90 95 

He Asn Glu Val Lys Pro Thr Glu He Tyr Asn Leu Gly Ala Gin Ser 
100 105 110 

His Val Lys He Ser Phe Asp Leu Ala Glu Tyr Thr Ala Asp Val Asp 
115 120 125 

Gly Val Gly Thr Leu Arg Leu Leu Asp Ala Val Lys Thr Cys Gly Leu 
130 135 140 

He Asn Ser Val Lys Phe Tyr Gin Ala Ser Thr Ser Glu Leu Tyr Gly 
145 150 155 160 

Lys Val Gin Glu He Pro Gin Lys Glu Thr Thr Pro Phe Tyr Pro Arg 
165 170 175 

Ser Pro Tyr Gly Ala Ala Lys Leu Tyr Ala Tyr Trp He Val Val Asn 
180 185 190 

Phe Arg Glu Ala Tyr Asn Leu Phe Ala Val Asn Gly He Leu Phe Asn 
195 200 205 

His Glu Ser Pro Arg Arg Gly Ala Asn Phe Val Thr Arg Lys He Ser 
210 215 220 
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Arg Ser Val Ala Lys lie Tyr Leu Gly Gin Leu Glu Cys Phe Ser Leu 
225 230 235 240 

Gly Asn. Leu Asp Ala Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val 
245 2S0 255 

Glu Ala Met Trp Leu Met Leu Gin Asn Asp Glu Pro Glu Asp Phe Val 
260 265 270 

lie Ala Thr Gly Glu Val His Ser Val Arg Glu Phe Val Glu Lys Ser 
275 280 285 

Phe Leu His He Gly Lys Thr He Val Trp Glu Gly Lys Asn Glu Asn 
290 295 300 

Glu Val Gly Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp 
305 310 315 320 

Leu Lys Tyr Tyr Arg Pro Thr Glu Val Asp Phe Leu Gin Gly Asp Cys 
325 330 335 

Thr Lys Ala Lys Gin Lys Leu Asn Trp Lys Pro Arg Val Ala Phe Asp 
340 345 350 

Glu Leu Val Arg Glu Met Val His Ala Asp Val Glu Leu Met Arg Thr 
355 360 365 



Asn Pro Asn Ala 
370 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligoucleotide 



- - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TGATGAGCCA GAGGACTTTG TCATAGCTAC 3 0 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligoucleotide 



18 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S: 
CAGAAAGTCC ACTTCAGTCG GTCGGTAGTA 



30 
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